NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Item Response Theory—A Statistical Framework for Educational and Psychological Measurement

https://doi.org/10.1214/23-STS896

Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen; Ying, Zhiliang (May 2025, Statistical Science)

Free, publicly-accessible full text available May 1, 2026
A Note on Entrywise Consistency for Mixed-data Matrix Completion

Chen, Yunxiao; Li, XIaoou (August 2024, Journal of machine learning research)

Full Text Available
Variable selection in latent variable models via knockoffs: an application to international large-scale assessment in education

https://doi.org/10.1093/jrsssa/qnad137

Xie, Zilong; Chen, Yunxiao; von_Davier, Matthias; Weng, Haolei (December 2023, Journal of the Royal Statistical Society Series A: Statistics in Society)

Abstract International large-scale assessments (ILSAs) play an important role in educational research and policy making. They collect valuable data on education quality and performance development across many education systems, giving countries the opportunity to share techniques, organisational structures, and policies that have proven efficient and successful. To gain insights from ILSA data, we identify non-cognitive variables associated with students’ academic performance. This problem has three analytical challenges: (a) academic performance is measured by cognitive items under a matrix sampling design; (b) there are many missing values in the non-cognitive variables; and (c) multiple comparisons due to a large number of non-cognitive variables. We consider an application to the Programme for International Student Assessment, aiming to identify non-cognitive variables associated with students’ performance in science. We formulate it as a variable selection problem under a general latent variable model framework and further propose a knockoff method that conducts variable selection with a controlled error rate for false selections.
more » « less
Full Text Available
DIF Statistical Inference Without Knowing Anchoring Items

https://doi.org/10.1007/s11336-023-09930-9

Chen, Yunxiao; Li, Chengcheng; Ouyang, Jing; Xu, Gongjun (December 2023, Psychometrika)

Abstract Establishing the invariance property of an instrument (e.g., a questionnaire or test) is a key step for establishing its measurement validity. Measurement invariance is typically assessed by differential item functioning (DIF) analysis, i.e., detecting DIF items whose response distribution depends not only on the latent trait measured by the instrument but also on the group membership. DIF analysis is confounded by the group difference in the latent trait distributions. Many DIF analyses require knowing several anchor items that are DIF-free in order to draw inferences on whether each of the rest is a DIF item, where the anchor items are used to identify the latent trait distributions. When no prior information on anchor items is available, or some anchor items are misspecified, item purification methods and regularized estimation methods can be used. The former iteratively purifies the anchor set by a stepwise model selection procedure, and the latter selects the DIF-free items by a LASSO-type regularization approach. Unfortunately, unlike the methods based on a correctly specified anchor set, these methods are not guaranteed to provide valid statistical inference (e.g., confidence intervals andp-values). In this paper, we propose a new method for DIF analysis under a multiple indicators and multiple causes (MIMIC) model for DIF. This method adopts a minimal$$L_1$$ $L_{1}$ norm condition for identifying the latent trait distributions. Without requiring prior knowledge about an anchor set, it can accurately estimate the DIF effects of individual items and further draw valid statistical inferences for quantifying the uncertainty. Specifically, the inference results allow us to control the type-I error for DIF detection, which may not be possible with item purification and regularized estimation methods. We conduct simulation studies to evaluate the performance of the proposed method and compare it with the anchor-set-based likelihood ratio test approach and the LASSO approach. The proposed method is applied to analysing the three personality scales of the Eysenck personality questionnaire-revised (EPQ-R).
more » « less
Full Text Available
Discussion: “Vintage Factor Analysis with Varimax Performs Statistical Inference” by Rohe and Zeng

https://doi.org/10.1093/jrsssb/qkad040

Chen, Yunxiao; Xu, Gongjun (April 2023, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Full Text Available
Asymptotically Optimal Sequential Design for Rank Aggregation

https://doi.org/10.1287/moor.2021.1209

Chen, Xi; Chen, Yunxiao; Li, Xiaoou (August 2022, Mathematics of Operations Research)

A sequential design problem for rank aggregation is commonly encountered in psychology, politics, marketing, sports, etc. In this problem, a decision maker is responsible for ranking K items by sequentially collecting noisy pairwise comparisons from judges. The decision maker needs to choose a pair of items for comparison in each step, decide when to stop data collection, and make a final decision after stopping based on a sequential flow of information. Because of the complex ranking structure, existing sequential analysis methods are not suitable. In this paper, we formulate the problem under a Bayesian decision framework and propose sequential procedures that are asymptotically optimal. These procedures achieve asymptotic optimality by seeking a balance between exploration (i.e., finding the most indistinguishable pair of items) and exploitation (i.e., comparing the most indistinguishable pair based on the current information). New analytical tools are developed for proving the asymptotic results, combining advanced change of measure techniques for handling the level crossing of likelihood ratios and classic large deviation results for martingales, which are of separate theoretical interest in solving complex sequential design problems. A mirror-descent algorithm is developed for the computation of the proposed sequential procedures.
more » « less
Full Text Available
Optimal Parallel Sequential Change Detection Under Generalized Performance Measures

https://doi.org/10.1109/TSP.2022.3231521

Lu, Zexian; Chen, Yunxiao; Li, Xiaoou (January 2022, IEEE Transactions on Signal Processing)

Full Text Available
Unfolding-Model-Based Visualization: Theory, Method and Applications

Chen, Yunxiao; Ying, Zhiliang; Zhang, Haoran (January 2021, Journal of Machine Learning Research)
Eliassi-Rad, Tina (Ed.)
Multidimensional unfolding methods are widely used for visualizing item response data. Such methods project respondents and items simultaneously onto a low-dimensional Eu- clidian space, in which respondents and items are represented by ideal points, with person- person, item-item, and person-item similarities being captured by the Euclidian distances between the points. In this paper, we study the visualization of multidimensional unfold- ing from a statistical perspective. We cast multidimensional unfolding into an estimation problem, where the respondent and item ideal points are treated as parameters to be esti- mated. An estimator is then proposed for the simultaneous estimation of these parameters. Asymptotic theory is provided for the recovery of the ideal points, shedding lights on the validity of model-based visualization. An alternating projected gradient descent algorithm is proposed for the parameter estimation. We provide two illustrative examples, one on users’ movie rating and the other on senate roll call voting.
more » « less
Full Text Available
Optimal Stopping and Worker Selection in Crowdsourcing: an Adaptive Sequential Probability Ratio Test Framework

https://doi.org/10.5705/ss.202018.0300

Li, Xiaoou; Chen, Yunxiao; Chen, Xi; Liu, Jingchen; Ying, Zhiliang (January 2021, Statistica Sinica)

Full Text Available
A Note on Exploratory Item Factor Analysis by Singular Value Decomposition

https://doi.org/10.1007/s11336-020-09704-7

Zhang, Haoran; Chen, Yunxiao; Li, Xiaoou (June 2020, Psychometrika)
null (Ed.)
Abstract We revisit a singular value decomposition (SVD) algorithm given in Chen et al. (Psychometrika 84:124–146, 2019b) for exploratory item factor analysis (IFA). This algorithm estimates a multidimensional IFA model by SVD and was used to obtain a starting point for joint maximum likelihood estimation in Chen et al. (2019b). Thanks to the analytic and computational properties of SVD, this algorithm guarantees a unique solution and has computational advantage over other exploratory IFA methods. Its computational advantage becomes significant when the numbers of respondents, items, and factors are all large. This algorithm can be viewed as a generalization of principal component analysis to binary data. In this note, we provide the statistical underpinning of the algorithm. In particular, we show its statistical consistency under the same double asymptotic setting as in Chen et al. (2019b). We also demonstrate how this algorithm provides a scree plot for investigating the number of factors and provide its asymptotic theory. Further extensions of the algorithm are discussed. Finally, simulation studies suggest that the algorithm has good finite sample performance.
more » « less
Full Text Available

« Prev Next »

Search for: All records